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1 Introduction 



Cross-over experiments are widely used for comparing the responses to various different 
stimuli or treatments in areas ranging from psychology and human factor engineering to 
medical and agricultural applications; see, for example, the books by Jones and Kenward 
(2003), Ratkowsky, Evans and AUdredge (1992) and Senn (2002). Such experiments extend 
over a sequence of time periods. Each subject receives one treatment per period and an 
observation is made at the end of the period. The influence of a treatment on the subject's 
response may extend (or carry over) into the period following that in which it is administered. 
This is known as a first-order carry-over effect or first-order residual effect. In a simple 
statistical model for crossover studies, the response for a given subject in a given period is 
regarded as a sum of the effects of the subject, the period, the treatment given in this period 
(the direct effect of the treatment), the carry-over effect from the treatment given in the 
preceding period, and a random error. 

There is an extensive literature that assures us that a carefully designed crossover study 
can produce a wealth of information and that the parameters of interest can be estimated 
with high precision; see, for instance, Stufken (1996). This is based on the implicit, but 
critical, assumption that the experiment yields all planned observations. Yet, in many studies 
such as clinical trials, there is a substantial probability that some subjects will drop out of 
the study prior to completion of their treatment sequence. Low, Lewis and Prescott (1999) 
observed that a dropout rate of between 5% and 10% is not uncommon and, in some areas, 
can be as high as 25%. They give an example of a design in four periods, based on a 
WiUiams Latin square (WiUiams (1949)), where there is substantial loss of information if 
some observations are unavailable in period 4. Indeed, if all observations in the final period 
are not available, the design becomes disconnected, i.e., elementary contrasts are no longer 
all estimable. 
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It is important to note that a similar situation may arise in an interim analysis. When 
interim results on a cross-over experiment are analyzed, the interim design may consist of 
the planned design without the final several periods. 

The loss of connectedness is the most severe consequence of the unavailabihty of ob- 
servations. A general study of the loss of connectedness that results from unavailability 
of observations was done by Ghosh (1979, 1982). For crossover designs. Low, Lewis and 
Prescott (1999) formulated requirements for a planned design to be robust to dropouts in 
terms of the properties of the implemented designs that might result under a "completely- at- 
random" dropout mechanism (Diggle and Kenward (1994)). Godolphin (2004) also studied 
the problem of loss of connectedness of various designs, including crossover designs. 

An experimenter generally starts with a design, the planned design, that possesses desir- 
able properties, including high efficiency or optimality. If no subject drops out, the study 
yields the entire information that was envisioned at the planning stage. Dropouts, how- 
ever, lead to loss of information. The implemented design is the design that corresponds 
to all available observations, and this design can be identified only at the conclusion of the 
experiment. 

For the case of a Williams Latin Square of order 4 the expected information loss for various 
probability distributions of dropouts was studied by Low, Lewis and Prescott (1999). In this 
article, we focus on the maximum information loss that may be anticipated. For instance, 
in a study with four periods where subjects are expected to remain at least through the 
first three periods, dropouts, if any, would occur in the final period only. In this case 
minimal information is attained when all subjects drop out in the final period, which gives 
the minimal design. 

In this paper we assume that the planned design belongs to the class of Uniform Balanced 
Repeated Measurement Designs (UBRMDs). This is an important class of designs that have 
been studied extensively in the literature and are a popular choice in practice. UBRMDs 
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have elegant combinatorial balance and, under the simple model with additive i.i.d. errors 
with constant variance, possess various optimality properties; see, for example, Hedayat 
and Afsarinejad (1978), Cheng and Wu (1980), Kunert (1984), Hedayat and Yang (2003), 
and Hedayat and Yang (2004). (Refer to Stufken (1996) and Hedayat and Yang (2003) for 
additional references). A design is called uniform if (a) for each subject, each treatment is 
allocated to the same number of periods, and (b) for each period, each treatment is allocated 
to the same number of subjects. Furthermore, a design is called balanced for carryover effects 
{balanced, in short) if, in the order of application, each treatment is preceded by every other 
treatment the same number of times and is not preceded by itself. 

The goal of this research is to study the maximum loss of information and the result- 
ing loss of precision of the estimators that result from subject dropout when the planned 
design is a UBRMD. Since the maximum loss is attained by the minimal design, we study 
properties of this design, including its information matrix and efficiency. If the maximum 
loss of information is not deemed to be large, then the experimenter may conclude that no 
modification of the plan for the experiment is needed. On the other hand, if the loss is large, 
the experimenter should consider alternative strategies. 

We work in the same setup as Low, Lewis and Prescott (1999); in particular we assume a 
completely-at-random dropout mechanism. Also, we assume throughout that a subject who 
leaves the study does not re-enter. In Section 2 we derive general formulae for, and study 
the properties of, the information matrix of the direct effects of the minimal design when 
the planned design is a UBRMD, with subject dropouts occurring in the final m periods 
only. We examine the connectedness of the minimal design and, in particular, show that a 
UBRMD based on t > 5 treatments remains connected even when all observations in the 
final period are unavailable. We also develop measures for the maximum loss of precision due 
to subject dropout and the efficiency of the minimal design. In Section 3 we study the case 
of one-period dropout in more detail, including the special case when the planned design is 

4 



based on Williams Latin squares. Also in this section, we identify members of the class of 
UBRMDs for which the loss of information is small. 

The focus of this paper is to study certain properties of UBRMDs in the presence of 
subject dropout. For the broader problem of designing a crossover experiment in the presence 
of subject dropout one has to choose a planned design from a class (not necessarily the class 
of UBRMDs) of highly efficient designs for which the loss of information is small, and the 
corresponding minimal design is highly efficient. 

2 Setup and general results 

Consider a planned design with p periods, s subjects and t treatments. The simple model 
for the vector of response variables obtained from the implemented design can be written as 

y = + Xpa + Xdt + Xcp + e, (1) 

where e is the vector of random error variables, /3 is a vector of s subject effects, a is a vector 
of p time period effects, r is a vector of t direct treatment effects, p is a vector of t carry- 
over effects, and the X matrices are the corresponding design matrices. The treatments are 
labelled 0, 1, ...,t — 1. For the purposes of designing efficient experiments, all effects in the 
model are assumed to be fixed effects. 

We define the following incidence and replication matrices: Nsd — X'^Xd, Ngc — X'gXc, 
Npn = X'pXn, Npc = X'pXc , Nnc = X'^Xc, td = NdsU = Ndp\, tc = NcsU = 
Ncplp, where the "prime" denotes transpose and !„ is a vector of a unit elements. Also 
we define Jaxb = lal'b, Ja = Jaxa, Nji = Nlj (for i,j = S,P,D,C), rf, = diagiro), and 

— diag{rc)- Moreover, denotes an a x a identity matrix. We order the responses 
period by period for each subject in turn, so that, Xp — Is <S> Ip and Xs — Is <S> Ip. 

The joint information matrix for estimating the direct and carry-over (residual) treatment 

5 



effects is given by 



C 



Cll Ci2 

C21 C22 



(2) 



wfiere Cn ^ r}) -{ rijr^ NdsNsd NdpNpd 



C 



12 



ps 



P 



C22 — 



Tc + —rcr'c - -NcsNsc - -NcpNpc 
ps p s 

Ndc + —rDr'c- -NdsNsc - -NdpNpc. 
ps p s 



The information matrices for the direct effects and the carry-over effects, respectively, are 



— — <-^12<-^22*-^21 , 



R 



C22 — C'2lC'iiC'l2 



(3) 
(4) 



In this article we focus primarily on Cd- 

Throughout, we assume that the planned design dpian is a UBRMD with p = t time 
periods, s — gt subjects, based on t treatments and, in the implemented design dimp, all 
subjects complete their allocated treatment sequence in the first t—m periods (1 < m < t— 1). 
After the first t — m periods, subjects may start dropping out of the study completely at 
random. Since we assume that, once a subject drops out of the study, the subject will not 
return, the worst case scenario occurs when all subjects drop out at period t — m. The design 
c^min, composed of the first t — m periods of dpian, is called the minimal design. 

For even t, a Williams Latin Square gives a UBRMD, as docs any sequentially coun- 
terbalanced Latin Square (see Isaac, Dean and Ostrom (2001), for a survey). For t odd, 
a UBRMD cannot be constructed from one Williams Latin square, but such a design with 
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2t subjects can be constructed from a pair of squares. In addition, when i is a composite 
number, Higham (1998) has shown that there exists a UBRMD in t subjects, t periods and 
t treatments. The union of UBRMDs (identical or distinct) with the same value of t is a 
UBRMD. Here are some examples. 

Example 1 Three UBRMDs are shown below, where the columns show the treatment 

sequences and the rows correspond to the time periods. The design d2pian is a Williams 
Latin Square of order 4, while designs dipian and d^pian consist of a pair of Williams Latin 
Squares ior t — 3 and t — 5 treatments, respectively. 

1234034012 

3 

0123440123 



2340123401 

2 

4012301234 

1 

3401212340 

If m = 1, i.e. subjects may drop out in the final period only, then each array with the last row 
deleted gives the corresponding minimal design dmin- It can be verified that the information 
matrices of dmin will have rank 2, 1 and 4, respectively. So, as noted by Low, Lewis and 
Prescott (1999), the minimal design corresponding to d2pian is disconnected; indeed the only 
estimable direct treatment contrast in rf2min is tq — ti + T2 — T3. On the other hand the 
minimal designs corresponding to dipian and d^pian are connected, the former has a nonzero 
eigenvalue 0.125 with multiphcity 2; the nonzero eigenvalues of the latter are 2.61 and 3.73, 
each with multiplicity 2. 

Since UBRMDs with three periods have been studied in Jones and Kenward (2003) and 
Low (1995), henceforth we consider t > 4. The following lemma shows that dmin corresponds 



dipian 

1 2 2 1 
12 12 

2 112 



d2plan 

12 
12 3 
3 1 
2 3 
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to the maximal loss of information. For nonnegative definite matrices A and B we use A>z B 

to denote the fact that A — B is a. nonnegative definite matrix, the Lowner order. 

Lemma 2 cj"'"" >z C*'"^ h C^"^. 

This follows from known general results for linear models; for instance, it is a consequence 
of Theorem 2.1 of Hedayat and Majumdar (1985). Lemma 2 says that dmin has the "smallest 
information matrix" among all possibilities for dimp- The matrices, Cn, C22 and C12 for cimin 
are as given in Theorem 4. First, we need some notation. 

The P and U matrices. For j = 1, ...,t, Pj denotes a t x s matrix with {h,i) entry 1 if 
subject i receives treatment h in period j of dpian; it is otherwise. For j — 0, 1, ...,t — 1; 
/c = 0, 1, i — 1, let, Ujk = Pt-jPl_i.. Note that, since dpian is a UBRMD, 

Pjls^glt,P;it^ls. (5) 

Also, the entries of Ujk are nonnegative with row and column sums equal to g. Hence ^Ujk is 
a doubly stochastic matrix; in particular Ujj — git- The following lemma gives the properties 
of these matrices that we need. 

Lemma 3 If Ui, Um (not necessarily distinct) are t xt matrices such that ^Ui is doubly 

stochastic for each i = 1, ...,M, and ai, ...jaM are nonnegative real numbers, then for x £ 
with x'x — 1, x'UiX < g for i — 1, M, and x'{^ aiUi)'(^ aiUi)x < g^{^ ai)'^. 

Proof If we write Wi = ^Ui, then it follows from the properties of doubly stochastic 
matrices (see, for example, Bapat and Raghavan (1997, Chapter 2)) that x'WiX < 1. 
Also, x'{Y,aiWiy{Y.a^Wi)x = x' {^^aiajWlWj)x < ^ ^ a^a^ ^x^TiTTto^W^W^ < 
^ ^QiUj. The lemma follows. ■ 

Theorem 4 Let dpian be a UBRMD with t treatments, t time periods, s = gt subjects, and 
let dmin consist of the first t — m periods of dpian- Then for dmin; 
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(i) the information matrix for estimating direct and carry-over treatment effects is given 
by (3) and (4) with 



Cn- "'"-'"' -'"' j,^4^J.--J- U,., (6) 

t — m t — m t — m — — ^ 

J^fc = 0,...,»Tl— 1 



(7) 



C^2 = j^\{{t-mf-{t^mt- 
t-\{t-mf-{t^\)-m{m^\))\J,-j^ Ujk, 

j^k=0,...,m 
m— 1 ^ m— 1 m 

Cu = ^[(^ + 1)^. - tit] - E^.O+i) - ^EE^^'^ (8) 

i=0 j=0 k=0 

t>2m + 2 (9) 
t/ien a g-inverse of C22 is C22 = where 



A=^[{t- mf -{t + IM - ^ E E U.^- (10) 

j^fc=0,...,m 



Proof (i) Since (ipian is a UBRMD, every treatment appears s/t = g times in every period 
and s times in total. Also, in the order of application, each treatment is preceded by every 
other treatment the same number of times and is not preceded by itself. It follows that for 
the design d^^^, 

m—l m 

= Jtxs — E"^*"-?' ^'^^ = Jtxs — y^fi-j, 

j=Q j=0 
NdP = gJtx{t-m}, Ncp = [Ot gJtxit-m-1)] , 

Td = g{t-m)lt, rc = g{t-Tn-l)lt. 
where Ot is a vector with t zero elements. Inserting the above formulae into (E]) and using (5) 
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yields expressions Cn, C22 and C12 as in the statement of the theorem, after some algebra. 

(a) Using the relation C22lt = Oj, it can be verified that 6*22^" ^(^22 = (^22, as long as 
A'^ exists; hence is a g- inverse of €22- We now show that condition (9) guarantees the 
nonsingularity of A. It follows from the fact that the row sums of ^ ^ Ujk are gm{m + l), 

j^k=0,...,m 

and Lemma 3, that the minimum eigenvalue of A is 

Xr.in{A) = 73— [(t - mf - (t + 1) - m{m + 1)]. (11) 

This is positive if {t — m)^ — (t + 1) — m(m + 1) > 0, which is equivalent iot> 2m + 2. ■ 
Prom the proof of Theorem 4, (9) guarantees that A in (10) is positive definite, and 
hence it is sufficient for the nonsingularity of A. This condition plays a critical role in 
the derivation of bounds for the eigenvalues of C^™'". Also, it can be shown that (9) is a 
necessary and sufficient condition for rank{C22) = t — 1. The next result gives a bound on 
the eigenvalues of C^"'° which is used to study the loss of precision for the estimators of the 
treatment contrasts and the efficiency of dmin- 

Theorem 5 Suppose m >1 and t > 2m + 2 and dpian is a UBRMD with t treatments, t 
time periods, and s = gt subjects. Denote the eigenvalues of C^™" by g6o = 0,g6i, ...,g6t-i. 
For r — 1, t — 1, 9r > OL{t, m), where 

eL{t,m)^-^ 

t — m 

Proof Suppose x e R*, with x'x — 1, and x'lt = 0, such that C^™x — gOrX, r — 1, ...,t — l. 
Then g9r = x'CuX — x'Ci2A~^C2iX. The maximum eigenvalue of A~^ is 1/ {\xmn{A)) where 



{t - 2m) 



t{m + iy 
(t - mV -(t + 1)- mim + 1) 



(12) 
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Amin(^) is given by (11). Hence, g9r > x'CuX — t — K-nx'Ci2C2iX. If we write, 



m— 1 m— 1 m 



j=0 j=0 k=0 



then C21 = [gtit + V' - g{m + 1)7*] . So, 



gOr > x'CnX - ^--j- ^ [{gtf + gtx'iV + V')x + x'VV'x] . (13) 



The following inequalities can be derived by applying Lemma 3: 



x'Cnx > -^{t-2m) 
t — m 

x'{V + V')x < 2g [{t - m + l)m + m{m - 1)] = 2gmt 

x'VV'x < {gmtf . (14) 



Inserting them into (13) and using the fact Amm(^) > 0, which follows from the condition 
t > 2m + 2, we get a lower bound to 9r which, upon simplification, reduces to (12). ■ 

The results of Theorem 5 are useful in studying properties of c^min- Connectedness is 
a basic property of a design. A sufficient condition for dmin to be connected for direct 
treatment effects is 9L{t,m) > 0. It follows from Lemma 2 that dimp is connected whenever 
dmin is connected. Corollary 6 follows from (12). 

Corollary 6 Suppose dpian is a UBRMD with t treatments, t time periods, and s — gt 
subjects. A sufficient condition for the minimal design d-aan to be connected is that 

{t - 2m) [{t - mf - (t + 1) - m{m + 1)] - t{m + 1)^ > 0. (15) 



For a given m, it follows from (15) that dmin is connected if a polynomial in t of degree 
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3, which has leading coefficient 1, is positive. For m — 1, (15) reduces to — bt^ -\- A > 0, 
i.e., i > 5, and for m = 2 it reduces to — + 8t + 12 > 0, i.e., t > 8. These observations 
lead to the following result. 

Corollary 7 Suppose dpian is a UBRMD with t treatments, t time periods, and s = gt 
subjects. For each m > 1, there is a positive integer t*{m) such that the design dmin is 

connected if t > t*{m). In particular, for m = 1, dmin is connected whenever t > 5; for 
m — 2, drain is connected whenever t > 8. 

One way to measure the goodness of a connected design d is by the harmonic mean of 
the eigenvalues of the information matrix C^,, Hd — {t — 1) /trace{C%)'^ , where C+ denotes 
the Moore-Penrose inverse of C. This is the value of the A-criterion; hence Ha is a measure 
of the precision of estimators of the direct treatment contrasts for the design d. It follows 
from Lemma 2 that Ha^.^ < Ha-^^ < Ha^i^^ ■ Since dpian is the design that was chosen at the 
start of the experiment on the basis of its desirable properties, especially efficiency, it is of 
interest to examine the loss of precision in the implemented design dimp with respect to dpian 
due to subject dropout. This loss may be measured by 

<ii.p:<ipian ^^^^^^ trace{C'^-n+' 

Clearly, the maximum loss of precision due to subject dropout for dpia,^ is given by 

r - ^ tracejCj^n^ 

<i^.n.<ipu„ ^^^^^ trace{C''j^'-)+' 

When dpian is a UBRMD, we get using Theorem 5, 

t-1 



traceiCt'^y - ^4" - 



-^gOr g9L{t,m) 
12 



Prom Hedayat and Afsarinejad (1978), we obtain for the UBRMD dpian, 



^ t^-t-1 ^ t ^ ^ ' gt{t-2){t+l) 

Therefore we obtain the following result. 

Corollary 8 Suppose dpian is a UBRMD with t treatments, t time periods, and s — gt 
subjects. An upper bound to the maximum loss of precision due to subject dropout is given 
by MLd^^^^ < UML{t, m) where, 

with 9L{t,m) given by (12). 

For m = 1 and t > 5, the values of UML{t, 1) for selected values of t are given in Table 
1, where the planned design dpian is a UBRMD. Similarly for m = 2 and t > S, the values of 
UML[t,2) for selected values of t are given in Table 2. As one would expect, for fixed m, 
the bounds decrease with t and become reasonably small when t is considerably larger than 
t*{m). In general, (16) is conservative. Hence, the prospect of subject dropout may not be 
a big concern when t is much larger than t* (m) . 

If the UBRMD dpian is chosen to have certain combinatorial structures, the bound 
UML{t,m) can be improved. One such structure is considered next. 

Type Wm UBRMD. Suppose the subjects of the UBRMD dpian can be partitioned into g 
sets of t subjects each such that, within each group, every treatment appears once in each 
of the periods t — m,t — m + 1, t for fixed m > 1. Then for j, k — 0, m, j ^ k, 



gt{t - 2){t + 1) 1 ^ ...„„.^^p.an^+ {t -l){e-t- 1) 



Ujk - Pt-jPi-k - yj^jkl, 
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where each 11^^.; is a permutation matrix of order t and Pj is defined in Section 2. If for each 
j,k = 0, ...,m, j 7^ k and / = 1, ..,g, the eigenvalue 1 of Ujki has multiphcity one, then we 
say the UBRMD dpian is of type Wm- 

If m > 2 an UBRMD of type Wm is also of type Wm-i- Examples of UBRMDs of type 
Wt-i are UBRMD's that are cyclically generated, for instance the Williams Latin Squares 
and pairs of Williams Latin Squares given in Families 1 and 3 of Hedayat and Afsarinejad 
(1978), and the class of sequentially counterbalanced squares described by Isaac, Dean and 
Ostrom (2001). 

It is known that the eigenvalues of a permutation matrix 11 of order t are the roots of 
unity, — Cos{27rr/t) + i Sin(27rr/t), r = 0, 1, ...,t — 1, unless the permutation can be 
factored into the product of two or more disjoint cycles, in which case the multiplicity of 1 as 
an eigenvalue of 11 is larger than one (see, for example, Davis (1979)). If we set ip^, = Cos(^) 

then, for a UBRMD of type Wm, the eigenvalues of 11^^; + 11^^.; are 2t/j^, r = 0, 1, t — 1. 

g 

Since Ujk + Ukj = X] {^jki + ^jki) ; ^'t^ — ^^'^ ^'^ — 1 implies x'iUjk + Ukj)x < 2gip'^^. 



If we insert these inequalities into (13) we obtain the following result. 

Theorem 9 Suppose dpian is a UBRMD of type for fixed m > 1, t > 2m + 2. Denote 
the eigenvalues of Cf) by gOg — 0,g9i, ...,g6t-i. For r = 1, ...,t — 1, ^ ^z,(^)"^)) where 



Hence the inequalities (14) may be replaced by 



x'C^.x >^[{t- 2m) + ^ 
x'{V + V')x < 2gmti) 
x'VV'x < {gmtf . 



(1-V'i)] 



Olit,m) 



t 



{t - 2m) + 



m{m — 1) 



(i-V'i) 



i(l + 2V'im + m2) 



t — m 



t 



{t - m)2 -{t + 1)- m{m + 1) 



with ipi — Cos^. 
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Since ■0i < 1, > ^L(i,"^). Hence replacing 6L{t,m) by 9\{t,m) in (16) gives a 

sharper upper bound to the maximum loss of precision due to subject dropout when dpian is 
a UBRMD of type Wrn, i-e., ML^p,^^ < UML*{t,m) < UML{t,m), where 



t{t-2){t + l) 

For m — 1 and i > 5, the values of the upper bound UML*{t, 1) to the maximum loss of 
precision due to subject dropout when rfpian is a UBRMD of type Wm for selected values of 
t are given in Table 1. For m — 2 and t > 8 the values of UML*{t, 2) for selected values of 
t are given in Table 2. 

We now consider the efficiency of (imin, the design that corresponds to the worst case 
scenario. Let D{t, gt, t—m) denote the class of all connected crossover designs (not necessarily 
uniform or balanced) in t — m periods and gt subjects, based on t treatments, rfmin belongs to 
this class. Since for an arbitrary design d e D{t, gt, t — m), trace{Cf))^ ^ (^ ~ lY /traceCfj, 
a lower bound for trace{Cj))^ may be obtained from an upper bound of traceCjj. The latter 
bound can be obtained from Theorem 3 of Hedayat and Yang (2004) (which generalized a 
bound of Stufken (1991)) as follows, 

, '2(gt-5*) (t-m-l)S*^ 

Max traceCj) = gt{t - m - 1) ^ ^ ^ 



deD{t,gt,t-m) t — m g{t — m){t{t — m) — t — 1) ' 

where 5* is the nearest integer to . gince the choice 5* = gj^gg 

upper bound to the maximum, it can be shown that traceCf) < gMTr{t,m), where 

MTr{t, m)^t{t-m-l) ^ ' 



{t — m){t — m — 1) 
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A measure of the efficiency of dmin in D{t, gt, t — m) is 



Min traceiCky 

.d^in d&D{t,jt^t-n,) (t-l) 



papain _ ' ' i > 



D{t,gt,t-m) trace{C'j^''^^)+ ~ gMTr{t,m){trace{C'i^''^^)+)' 

It follows from Theorems 5 and 9 that, if we define 

{t-l)eL{t,m) , , it -l)e*r it, m) 

then the inequalities EFF^^;^'g^^^_^^ > EL{t,m) and EFF^j;^^^^^_^^ > EL*{t, m) give lower 
bounds to the efficiency of dmin when dpian is a general UBRMD and a UBRMD of type Wmi 
respectively. Note that both EL{t,m) and EL*(t,m) take values in (0, 1). 

For m = 1 and t > 5, the values of the lower bounds EL{t,l) and EL*{t,l) to the 
efficiency of dmin in D{t, gt,t — l) for selected values of t are given in Table 1. For m — 2 and 
t >8, the values of EL{t, 2) and EL*{t, 2) for selected values of t are given in Table 2. Since 
EL{t,m) (or EL*{t,m)) measures the efficiency of dmin over a// designs in D{t,gt,t — m), 
not just those that are derived from UBRMDs, high values of this efficiency bound suggest 
that a different starting design, instead of the UBRMD dpian, would not have resulted in 
a substantially better dmin- An UBRMD dpian that has a small value of UML{t,m) (or 
UML*{t,m)) and a large value of EL{t,m) (or EL*{t,m)) for dmin clearly is a good design 
for use when there is a possibility of subject dropout. 



3 Further results for the case m = 1 

In the previous section we derived upper bounds UML{t,m) and UML*{t,m) to the max- 
imum loss of precision due to subject dropout ML^^^^^- In this section, we first establish 
formulae for ML^^j^^for two special families of UBRMDs. Then we indicate how to select 
a starting design UBRMD dpian for which ML^^^^^ is small. For simphcity, we focus on the 
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case m — 1, i.e., subjects remain in the study at least through period t — 1. We start with 
definitions of the famihes of UBRMDs that we will study. 

Class A type Wi UBRMD: Start with atxt square W that is a UBRMD with columns 
denoting treatment sequences and rows denoting periods. Let Pt{W) and Pt-i{W) be the 
t X t matrices defined in Section 2 corresponding to periods t and t — 1, respectively, for 
square W, i.e., the {h,i) entry of Pj{W) is 1 if the entry of W is h; it is otherwise. 
Let n = Pt{W)Pt-i{Wy . If 1 is an eigenvalue of 11 of multiplicity one, then the design dpjan 
that assigns g subjects to each sequence (column) of W is called a Class A type Wi UBRMD. 

CIeiss B type Wi UBRMD: We start with two t x t squares Wi and W2 such that the 
t X 2t design {Wi W2) is a UBRMD with columns denoting treatment sequences and rows 
denoting periods. For (5 = 1, 2, let PtiWs) and Pt-i{Ws) be the t x t matrices defined as in 
the previous paragraph, and take II^ = Pt{Ws)Pt-iiWs)' ■ Suppose 1 is an eigenvalue of II^ 
of multiplicity one for each 5 = 1,2. Suppose also that Wi and W2 are complementary in the 
sense 112 = H'^. Then the design dpian that assigns g/2 subjects to each sequence (column) of 
W is called a Class B type Wi UBRMD. 

Note that, for ease of implementation of the study, the experimenter will generally assign 
several subjects to each of a small number of treatment sequences (see Jones and Kenward 
(2003), p 159). Examples of Class A type Wi UBRMD when t is even are the Williams 
Latin squares given in Family 1 of Hedayat and Afsarinejad (1978) with g subjects assigned 
to each sequence, and examples of Class B type Wi UBRMD when t is odd arc the pair 
of William squares given in Family 3 of Hedayat and Afsarinejad (1978) with g/2 subjects 
assigned to each sequence. 

Theorem 10 Suppose t > A. (i) If dpi^^ is a Class A type Wi UBRMD then, for the 
minimal design (imin that consists of the first t — 1 periods of dpi^n, the eigenvalues of C^°"° 
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are qOq — and g6r, where 



t-1 



t-2- 



2t{l + CosC-f)) 
t{t - 3) - 2Cos(2f ) 



r = l,...,t-l. 



(17) 



(a) If dpian is a Class B type Wi UBRMD then, for the design dmin that consists of the 
first t — 1 periods of (ipian, the eigenvalues of C^"'° are g6o = and g6r, where 



t-1 



t{l + Cos{^)f 
t{t - 3) - 2Cos(^ 



t-2- 



r^l,...,t-l. 



(18) 



Proof Write C/ = C/oi = PtPU- Then Cn = {gt{t - 2)/{t - l)){It - \Jt), Cu = -{l/{t - 

l)){gtIt-2gJt + tU), andC22 = A- {g{t^ -3t-2)/{t{t-l)))Jt, where A = {{gt{t - 3) / {t - 

t-1 

l))It — jzi{U+U'). Consider the spectral decomposition, U + U' = Y^'^rKK^ '^^^^ K^r — 1, 

r=0 

r = 0, 1, t - 1, h'^hq = 0, for r =^ q; ao = 2g, Hq = ^1*. For r = 1, t - 1, h'^lt = 0. Let 
7^ = (/j + ^U)hr, r — 0,1, ...,t — 1. It can be shown that 



9t 



1 

-[(t - 2)(/t - -Jt,t) - gtY,{9t{t - 3) - a.)-i7r7; 

r-=l 



Note that, by Lemma 3, < 2g'. Hence for t>A, gt{t — 3) — > gt{t — 3) — 2g > 0. 

(i) In this case, U — gll. For r = 1, t — 1, {U + U')hj. = ajij. implies a,. = 2g^j.,wh.ere 
= Cos{^). Since 7^ = (/* + H) /i^, 7^7^ = /i;(2/t + n + n')/ir = (2 + 2^^^ /ij/ir- Therefore, 
if we write 7^ = ^1*, 7* = (2 + 27/jX^/^ Ir = (2 + 2^',)"^^^ {h + H) /i^, r = 1, ...,t-l, then 
{7o' 7i' 7t-i} orthogonal and normalized basis of 7^*, and for r = 1, t — 1, 



gt 

t-1 



t-2 



t{2 + 2ij,) 
t{t - 3) - 2ip^ 



7r 



gt 

t-1 



t-2- 



2t(l + Cog(^)) 
t(t - 3) - 2Cos{^] 



which establishes (i). 

(a) Here U = |(Hi + H'J = U' . For r = l,...,i-l, {U + U')K = a^^^ implies = 2^^; 
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7r = (^It + hr — {1 + ipj.)^r- follows that, ho,hi, ...,hr are orthogonal and normalized 

eigenvectors of C^™'" , and 



/^dmin I, 9^ 

^ t-1 



t{l + Cosi^)y 
t{t - 3) - 2Cos(^; 



t-2- 



for r — 1, t — 1. This establishes (^zij. ■ 

For i = 4 it can be shown that 6i — 0,02 — 2.67, and 9s — 0. Hence dmin is not connected. 
For t > 5 however, it follows from Corollary 7 that dmm is connected. The following Corollary 
is immediate. 

Corollary 11 Suppose t > 5. If dpian is a Class A or Class B type Wi UBRMD then the 
loss of precision due to subject dropout is 

where the 9r's are given by (17) and (18) for Class A and B, respectively. 

Theorem 10 also gives a lower bound to the efficiency of dmin in D{t, gt, t—1), EFF^^^^^ > 
EL^^{t), where 

EL^^{t) 



MTr{t,l) Wr[er 



and where MTr{t,l) ^ t{t - 2) - The difference between the bounds EL^^{t) and 
EL*{t, 1) defined in Section 2 is that in the former (which applies to Class A or Class B type 
Wi UBRMD) exact values of the eigenvalues 9r are used while in the latter (which apphes 
to any type Wi UBRMD) these are replaced by the lower bound ^^(t, 1). 

Table 3 gives MLa^^^^ and EL^^{t) for Class A and Class B type Wi UBRMD for selected 
values of t. Note that the values of the maximum loss due to subject dropout ML^^^^^ are 
substantially lower than the upper bounds given in Table 1, while the values of the efficiency 
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bound EL^^{t) are higher. 

For g > 1, a. Class A type Wi UBRMD consists of g rephcations of a square while a Class 
B type Wi UBRMD consists ol g/2 rephcations of a pair of squares. The next result implies 
that the loss due to subject dropout may be smaller if distinct squares (or pair of squares) 
are used instead of replications. 

Corollary 12 Suppose t > 5. If rfpian is the union of g t x t Class A type Wi UBRMDs 
or g/2 t X 2t Class B type Wi UBRMDs that are not necessarily identical, then 



_ t{t-2){t + l) 



'''^imp'C^plan — ^min'^plan ^plan — ■/■/'■/■ 0\ f -f- _L 1 ^ 1 x ^ / ' 



1) ^ (^e. 



-1 



where the 9r's are given by (17) and (18) for Class A and B, respectively. 

Proof We sketch a proof for Class A type Wi UBRMDs; the proof for Class B type Wi 

9 

UBRMDs is identical. Suppose dmin = [Jc^i, where di is the minimal design for a Class A 

i=l 

type >Vi UBRMD design based on t subjects for i = 1, ■■■,g- It follows from Theorem 2.1 of 

Hedayat and Majumdar (1985) that, C^-"'" h f^C*. Take B = Cf^+^Jt, and Bi = C^j+j-^Jt, 

1=1 

for i = 1, g. Since each di is connected, the matrices Bg and B are positive definite. 

Clearly, B h Y^Bi. It follows that 

i=l 



-1 



where the first inequality is well known in matrix theory and the second is given in Bapat 
and Raghavan ((1997), Theorem 3.11.1). It can be shown that, B'^ = [C^'"")^ + \Jt, and 
= {C^y + iJt: for t = 1, ...,g. Hence,(C^-)+ :< {C'dV ■ This implies. 



i=l 



9 t-1 ^ ^ t-1 



trace (Cj-)* < 1^*™" (Ci)" = ^EE^ = ^E^ 
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The result follows. ■ 

For the setup of Corollary 12, it is clear that a lower bound to the efficiency of (imin in 
D{t, gt, i - 1) is 

EFF'^t9t,t-i) > 1)V (^MTr{t,l){El/9r)^ . 

Corollary 12 indicates that the use of distinct Class A or Class B type Wi UBRMDs 
instead of replications of the same design will not increase the maximum loss of precision 
due to subject dropout MLa^^^^. There are examples where ML^^j^^ actually decreases. In 
their Example 2, Low, Lewis and Prescott (1999) studied the case t = 4, s = 24, m = 1 and 
showed that the use of distinct William Squares instead of replications of the same square 
reduces ML^^^^^. An example for i = 6 is given below. The implication is that a UBRMD 
with more distinct sequences is likely to perform better under subject dropout. 

Consider the "extreme" design d^^^^^ that consists of one subject assigned to each of the t\ 
possible sequences (s = t!). For the case m = 1, it can be shown that Uqi = {{t — 2)!)[Jt — It] 
and the information matrix of the minimal design d^j^ is 



.e at{t-2)[{t-2y.] f 1 \ 



r - 5t=^ + + t - 2 
^3 - 4^2 + 3i + 2 ■ 

For dpig^j^ the maximum loss of precision due to subject dropout is. 



ML,e^^^ = 1 - a{e - t - l)/{{t - mt + 1)). 



Numerical studies indicate that this is the smallest value of MLd , among all UBRMDs 

"plan <-> 

with t\ or fewer subjects. We are currently investigating the nature of planned designs that 
attain the minimum and maximum values of ML^^j^^ , as well as designs that fall in between 
these extremes. However, as mentioned earlier in this section, a small number of treatment 
sequences is generally preferred, so it is doubtful that crossover designs with a large number 
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of distinct sequences will be used widely in practice. 

Example 13 Let t — 6 and s — 12go. The array below consists of two Williams Latin 
squares. 

Square 1 Square 2 

123450 251304 
012345 425130 
234501 513042 
501234 042513 
345012 130425 
450123 304251 

Suppose dpi^^ is a design that assigns 2gQ subjects to the first six columns of the array and 
^pian ^ design that assigns go subjects to each of the twelve columns. If m = 1, then the 
maximum loss of precision due to subject dropout are ML^i = 0.30 and MLj2 — 0.24. 

plan plan 

The design d^^^^^ can be constructed when gg = 60. For this design, ML^e^^^ = 0.20. 

Since estimation of the residual effects of the treatments is sometimes at least a secondary 
focus of experiments, we conclude this section with a brief consideration of the information 
matrix for the residual treatment effects of the minimal design d^am obtained from a UBRMD 
(ipian when m = 1. Note that, Cj'"'' t Cjp^. Also, Cj'"" = C22 - C'aiCfiCis. Suppose t > 3. 
Then, it can be shown that {{t — l)/{gt{t — 2))) It is a generalized inverse of Cn. Prom (2) 
and (4) we obtain. 



^d^.n _ ( atjt' - 5t + 5) 
^ - ' {t-l){t-2) 



It - \jt,t 



t-2 



[U + U'] 



^g{t-l){t-2)J \t(t-l)(t-2). 
Using this, it can be shown that if the design dpian is a Class A or Class B type Wi UBRMD 
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then dmin is connected for residual treatment effects whenever t — 3, or t > 5. For t — A, 
Low, Lewis and Prescott (1999) have shown that if dpian is a WiUiams Latin square then 
c^min is disconncctcd for the residual treatment effects. 
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TABLES 



Table 1: Upper bounds to the maximum loss of precision due to subject dropout 
and lower bound to the efficiency of the minimal design when m = 1 



t 


5 


6 


7 


8 


9 


10 


UML{t, 1) 


0.87 


0.48 


0.33 


0.25 


0.20 


0.17 


UML*{t,l) 


0.64 


0.40 


0.30 


0.23 


0.19 


0.16 


EL(t, 1) 


0.18 


0.66 


0.81 


0.88 


0.92 


0.93 


EL*{t,l) 


0.49 


0.76 


0.85 


0.90 


0.93 


0.94 



Table 2: Upper bounds to the maximum loss of precision due to subject dropout 
and lower bound to the efficiency of the minimal design when m = 2 



t 


8 


9 


10 


11 


12 


16 


UML{t, 2) 


0.90 


0.63 


0.48 


0.39 


0.33 


0.20 


UML*{t,2) 


0.81 


0.59 


0.46 


0.38 


0.32 


0.21 


EL{t,2) 


0.15 


0.50 


0.67 


0.77 


0.82 


0.93 


EL*{t,2) 


0.27 


0.55 


0.69 


0.78 


0.83 


0.94 



Table 3: Maximum loss of precision due to subject dropout 

and lower bound to the efficiency of the minimal design 
for Class A and Class B type Wi UBRMD dpian when m — 1 



t 


5 


6 


7 


8 


9 


10 


Class 


B 


A 


B 


A 


B 


A 


"-•-plan 


0.35 


0.30 


0.20 


0.18 


0.14 


0.13 


EL^'^it) 


0.90 


0.89 


0.97 


0.97 


0.98 


0.98 



25 



